Segmented video codec for high resolution and high frame rate video

ABSTRACT

Embodiments disclosed herein provide systems, methods, and computer readable media for a segmented video codec for high resolution and high frame rate video. In a particular embodiment, a method of encoding a composite video stream provides identifying a first portion of an image of a video stream for encoding using first parameters and a second portion of the image of the video stream for encoding using second parameters. The method further provides segmenting the first portion of the image into one or more first tiles and the second portion of the image into one or more second tiles. The method further provides encoding the first tiles using the first parameters and the second tiles using the second parameters and, after the encoding, combining the first tiles and the second tiles into the composite video stream.

TECHNICAL BACKGROUND

Display screens continue to increase in resolution. Similarly, imagesdisplayed on those screens also increase in resolution in order to fullyrealize the benefits of the higher resolution displays. Regardless ofwhether these images are static or moving (e.g. video), increasing imageresolution typically increases both the size of the image files and theprocessing power needed to encode or decode the images. In other words,higher resolution images take up more storage space, use more bandwidthto transfer, and use more processing resources for encoding and decodingthan do lower bandwidth images. Moreover, depending on display size,there exists a distance from the display where higher resolutions ceaseto be discernable to the human eye. Similarly, in the case of videoimages, higher frame rates also increase the use of the above resources.

One standard display resolution is commonly referred to as 1080p or fullHD and has a resolution of 1920×1080. Newer displays, however, may havehigher resolutions including 3840×2160, which is commonly referred to as4K or Ultra HD. While Ultra HD allows for higher resolution images, duein part to the additional resources listed above, Ultra HD may not bedesirable in all situations. Moreover, the same image may includeportions that would benefit from Ultra HD and portions that would not.For example, some displayed information may have details that wouldbenefit from being shown in Ultra HD but other information may not,although, all the information is displayed at the same resolution. Inother examples, a certain parts of a video image contain motion thatrequires a higher frame rate than more static parts of the video imageand a processor encoding or decoding the video may not be able to handleUltra HD at high frame rates.

Overview

Embodiments disclosed herein provide systems, methods, and computerreadable media for a segmented video codec for high resolution and highframe rate video. In a particular embodiment, a method of encoding acomposite video stream provides identifying a first portion of an imageof a video stream for encoding using first parameters and a secondportion of the image of the video stream for encoding using secondparameters. The method further provides segmenting the first portion ofthe image into one or more first tiles and the second portion of theimage into one or more second tiles. The method further providesencoding the first tiles using the first parameters and the second tilesusing the second parameters and, after the encoding, combining the firsttiles and the second tiles into the composite video stream.

In some embodiments, the first parameters comprise a first frame rateand a first resolution and the second parameters include a second framerate and a second resolution.

In some embodiments, the first frame rate is higher than the secondframe rate and the first resolution is lower than the second resolution.

In some embodiments, the composite video stream comprises a video havingthe first frame rate and the second resolution.

In some embodiments, the method further provides transferring thecomposite video stream to a decoder system, wherein the decoder systemdecodes the composite video stream for display.

In some embodiments, the decoder system segments the composite videostream, based on tile segment information, into the first and secondtiles before decoding the first tiles based on the first parameters anddecoding the second tiles based on the second parameters.

In some embodiments, the method further provides transferring the tilesegment information to the decoder system using a supplementalenhancement information (SEI) message.

In some embodiments, the encoding is performed using the H.265 codec.

In another embodiment, a non-transitory computer readable medium isprovided having instructions stored thereon. When executed by an encodersystem, the instructions direct the encoder system to perform a methodof encoding a composite video stream. The method includes identifying afirst portion of an image of a video stream for encoding using firstparameters and a second portion of the image of the video stream forencoding using second parameters. The method further includes segmentingthe first portion of the image into one or more first tiles and thesecond portion of the image into one or more second tiles. The methodfurther includes encoding the first tiles using the first parameters andthe second tiles using the second parameters and, after the encoding,combining the first tiles and the second tiles into the composite videostream.

In yet another embodiment, a non-transitory computer readable medium isprovided having instructions stored thereon. When executed by an decodersystem, the instructions direct the decoder system to perform a methodof decoding a composite video stream. The method includes receiving thecomposite video stream, wherein the composite video stream comprises acombination of one or more first tiles of an image of a video streamencoded using first parameters and one or more second tiles of the imageof the video stream encoded using second parameters. The method furtherincludes segmenting the composite video stream into the first and secondtiles and, after segmenting, decoding the first tiles based on the firstparameters and decoding the second tiles based on the second parameters.After decoding, the method includes combining the first tiles and thesecond tiles for display of the image of the video stream.

BRIEF DESCRIPTION OF THE DRAWINGS

Many aspects of the disclosure can be better understood with referenceto the following drawings. While several implementations are describedin connection with these drawings, the disclosure is not limited to theimplementations disclosed herein. On the contrary, the intent is tocover all alternatives, modifications, and equivalents.

FIG. 1 illustrates an encoder operational scenario in an implementation.

FIG. 2 illustrates an encoding method in an implementation.

FIG. 3 illustrates a decoder operational scenario in an implementation.

FIG. 4 illustrates a decoding method in an implementation.

FIG. 5 illustrates a multi-client operational scenario in animplementation.

FIG. 6 illustrates image segmentation in an operational scenario.

FIG. 7 illustrates an image frame sequence in an operational scenario.

FIG. 8 illustrates a computing architecture in an implementation.

DETAILED DESCRIPTION

The following description and associated figures teach the best mode ofthe invention. For the purpose of teaching inventive principles, someconventional aspects of the best mode may be simplified or omitted. Thefollowing claims specify the scope of the invention. Note that someaspects of the best mode may not fall within the scope of the inventionas specified by the claims. Thus, those skilled in the art willappreciate variations from the best mode that fall within the scope ofthe invention. Those skilled in the art will appreciate that thefeatures described below can be combined in various ways to formmultiple variations of the invention. As a result, the invention is notlimited to the specific examples described below, but only by the claimsand their equivalents.

FIG. 1 illustrates a representative encoding operational scenario 100 inwhich video is segmented for high resolution and high frame rateencoding. In operational scenario 100, a video encoder 101 is provided,which includes image portion identifier module 111, image segmentationmodule 112, first parameters encoding module 113, second parametersencoding module 114, and combination module 115. Modules 111-115 may beimplemented by software, firmware, or other type of processinginstructions, that direct a processor system to perform thefunctionality disclosed herein, may be implemented in dedicated hardwarecircuitry, or may be implemented in some combination thereof.

In operational scenario 100, video having image elements that wouldbenefit from one or more encoding parameters (e.g. resolution, framerate, etc.) are encoded separately from elements having one or moredifferent encoding parameters. For example, a video image, which isproduced by generating a sequence of image frames, may be generated byincluding both video captured by a video camera and other visualinformation (e.g. text, writing, logos, photos, slides, whiteboard,etc.) outside of the captured video into a single video image. This maybe the situation when live captured video is overlaid on top of othervisual information and the resultant image is compiled into a videostream. The camera video may require a higher frame rate to ensuremotion in the video is smooth when viewing while the other informationmay be more static or otherwise not require the smoothness allowed by ahigh frame rate. Additionally, the captured video may be captured at alower resolution than the surrounding image information.

Based on the video image described above, encoding the entire videoimage at the higher frame rate of the captured video and the higherresolution of the other visual information would generate an encodedvideo having unneeded data. That is, the encoded video would have datarepresenting a greater resolution of the captured video than theresolution with which the video was captured. Likewise, the encodedvideo would have data representing more frames than are necessary topresent the other visual information. Accordingly, the encoded videowould be larger and require more bandwidth than would an encoded videowithout the unnecessary data. Likewise, the processing resources neededto encode the entire visual image at the higher frame rate and thehigher resolution are greater.

Operational scenario 100 therefore receives a video stream for encodingand image portion identifier module 111 identifies portions of the videoimage that can be encoded using one set of parameters, the firstparameters, and portions of the video that can be encoded using adifferent set of parameters, the second parameters. The parameters eachinclude one or more specifications for how video should be included,such as frame rate, resolution, color/grey scale, bit rate, and thelike. Segmentation module 112 segments the video image into one or moretiles representing portions of the video image for encoding using thefirst parameters and one or more tiles representing portions of thevideo image for encoding using the second parameters.

It should be understood that video image, as used herein, includes asequence of frames of the video, as opposed to a single frame. Thus,when the video image is segmented into tiles, each tile is positioned inthe same place for each frame that produces the video image and,therefore, span the multiple frames that comprise the video stream.

Once segmented into tiles, the first parameter tiles are encoded usingthe first parameters in first parameters encoding module 113 and thesecond parameters tiles are encoded using the second parameters encodingmodule 114. In other words, instead of the entire video image beingencoded as a whole using a single set of parameters, the tiles areencoded, which allows different tiles to be encoded using differentparameters. Both encoding modules use the same codec to encode the tilesbut are differentiated in that they use the first and second parameters,respectively, for that encoding. Moreover, since each encoding modulemay process multiple tiles for the image, each encoding module mayencode tiles in parallel using multiple processors or processor cores.After encoding, combination module 115 combines the encoded tiles backinto a single encoded video image and outputs the resultant compositevideo stream.

The parameters of the composite video stream include the highest qualityparameters between the first and second parameters even though some ofthe image is not necessarily encoded with those highest qualityparameters. In other words, the composite video stream comprises a videoin the format produced by the codec having the higher quality parametersbetween the first and second parameters. Using the example from above,the composite video has the higher frame rate of the captured videowhile having the resolution of the other visual information, althoughother examples may have additional parameters affecting quality.Moreover, since the composite video stream generated by operationalscenario 100 looks like any other video generated by the codec, thecomposite video stream can be decoded by a conventional decoder for thatcodec. However, essentially reversing the encoding process by decodingthe video stream in tiles, as described below, allows a decoding systemor device to use less processing power to decode a high resolution, highframe rate video stream than would a conventional decoder.

The composite video stream may be stored or may be transferred over acommunication network, as a single file or streamed during real timecommunications or other streaming video distribution. Accordingly, itshould be understood that the term video stream as used herein refers tothe stream of video passing through encoder 101 for encoding (or decoder301 below for decoding) but that video stream is not necessarilytransferred as streaming video.

FIG. 2 illustrates a method 200 for encoding a composite video streamusing different parameters. Method 200 includes identifying a firstportion of an image of a video stream for encoding using firstparameters and a second portion of the image of the video stream forencoding using second parameters (step 201). The first and secondportions may be identified in information included with the video stream(e.g. designating the portions of the video image) from the source (e.g.compiler) of the video stream, the portions may be input into theencoder independently (e.g. a captured video may be independent of theother visual information that will make up the composite video), thevideo image may be processed to differentiate the first and secondportions, or by any other means by which different image portions may beidentified.

Once the first and second portions are identified, method 200 segmentsthe first portion of the image into one or more first tiles and thesecond portion of the image into one or more second tiles (step 202).Depending on the codec used, the tiles may comprise rectangular blockshaving sides that comprise lines crossing the entire width and height ofthe video image, although, other codecs may use different tile shapes.Once the tiles are segmented, method 200 encodes the first tiles usingthe first parameters and the second tiles using the second parameters(step 203). As noted above, the parameters may include anything thataffects the quality of the encoded video.

After the encoding, the first tiles and the second tiles are combinedinto the composite video stream (step 204). The composite video streamis a video having the higher quality parameters of the first and secondparameters. When combining, step 204 may blend the first and secondencoded tiles in the time domain and the spatial domain. For example, inthe time domain, if the first tiles have a higher frame rate than thesecond tiles, then the portion of the video image made up of the secondtiles is not updated each frame as are the first tiles having the higherframe rate. In the spatial domain, the lower resolution tiles may beplaced within a portion of the higher resolution composite video imagehaving a resolution corresponding to that lower resolution (e.g. a tilecomprising a resolution of 1920×1080 pixels may take up that same amountof pixels within a 3840×2160 composite video image), although, someexamples may employ scaling of the lower resolution tile.

It should be understood that the operation and methods described aboveand below can be performed on video images having portions correspondingto more than two sets of parameters. For example, a video image may beseparated in to four separate tile sets with each set having itsrespective first, second, third, and fourth parameters. Each of the fourtile sets are therefore encoded, and decoded, using their respectiveparameters.

FIG. 3 illustrates a representative encoding operational scenario 300 inwhich a composite video stream is segmented for decoding. In operationalscenario 300, a video decoder 301 is provided, which includes imagesegmentation module 311, first parameters decoding module 312, secondparameters decoding module 313, and combination module 314. Modules311-314 may be implemented by software, firmware, or other type ofprocessing instructions, that direct a processor system to perform thefunctionality disclosed herein, may be implemented in dedicated hardwarecircuitry, or may be implemented in some combination thereof.

While a composite video stream generated in accordance with theembodiments above is able to be decoded by a conventional decoder usingthe same codec, decoding tiles using decoder 301 in a manner similar tothe way the video was encoded allows the composite video stream to bedecoded using less processing resources with one or more decoders.

In operational scenario 300, the composite video is fed into decoder 301and image segmentation module 311 separates the composite video imageinto the first and second tiles. Module 311 may differentiate the tilesbased on information describing the tiles that was generated by theencoder when encoding the composite video stream and received along withthe composite video stream. For example, such information may betransferred in a supplemental enhancement information (SEI) message,which is part of the H.265 video codec, however other means oftransferring the information may also be used.

After segmenting the composite video stream, the first tiles are decodedin first parameters decoding module 312 based on the same firstparameters in which the first tiles were encoded. Likewise, the secondtiles are decoded in second parameters decoding module 313 based on thesame parameters in which the second tiles were encoded. Thus, instead ofdecoding the entire composite video image based on the parameters of thefully compiled composite video stream, each image tile is decoded usinga decoder catered for the parameters of that tile. Upon completion ofdecoding, the decoded tiles are combined in combination module 314, bothspatially and temporally, for output to a display.

It should be understood that, as with the encoder above, the compositevideo stream decoded by decoder 301 may include tiles with decodingparameters in addition to the first and second parameters described inthis embodiment. In those scenarios, decoder 301 may selectively choosewhich tile sets are decoded using their individual parameters ratherthan parameters for the video as a whole. This selective decoding stillrequires a minimum set of tiles to be decoded such that synchronizationwith the encoder is maintained.

FIG. 4 illustrates a method 400 for decoding a composite video streamusing different parameters. Method 400 includes receiving the compositevideo stream (step 401). The composite video stream may be received overa communication network via a network communication interface or may bereceived from a local storage medium. The composite video stream is thensegmented into the first and second tiles (step 402). As noted above,the tiles may be identified based on information received with thecomposite video stream. After segmenting, the first tiles are decodedbased on the first parameters and the second tiles are decoded based onthe second parameters (step 403). After decoding, method 400 combinesthe first tiles and the second tiles for display of the image of thevideo stream (step 404).

FIG. 5 illustrates a multi-client operational scenario 500 in which acomposite video stream is encoded and decoded. Operational scenario 500provides client device 501, client device 502, and communication network503. Client device 501 includes display 511, video encoder 512, andcommunication interface 513. Client device 502 includes display 521,video decoder 522, and communication interface 513. Communicationnetwork includes comprises network elements, such as switches, wirelessaccess nodes, Internet routers, network gateways, application servers,computer systems, communication links, or some other type ofcommunication equipment—including combinations thereof.

In operational scenario 500, video is streamed from client device 501 toclient device 502 as part of a real time communication. The clientdevices 501 and 502 may be personal computers, tablets, smartphones,video conference end point, or any other type of computingdevice—including combinations thereof. The real time communication maybe a video conferencing session or some other form of real timepresentation including video streaming. In particular, the real timecommunication includes both live video captured of presenters andinformation of an interactive whiteboard. Scenario 500 shows the livevideo positioned over the whiteboard on display 521, however, display521 need not present the video and whiteboard in this manner beforeencoding and transmission to client device 502. Rather, display 521 maysimply display the whiteboard with which the user(s) of client device501 interact and may handle the live video in the background.

In step 1, the video stream is sent through to encoder 512 for encoding.Encoder 101 may be an example of encoder 512 in client device 501. Thevideo may be sent to the video already compiled into a video image (i.e.with the live video overlaid on the whiteboard) or the video may betransferred to encoder 512 in separate components (i.e. the white boardand the live video distinct from one another). In this example, encoder512 is configured to encode the whiteboard in 4K resolution at 3.5frames per second (fps) and the live video in full HD 1080P at 30 fps.These parameters may be defined by a user, may be determined based onthe resolution and frame rate of the video components (i.e. thewhiteboard data and the live video) as received by encoder 512, or bysome other means. The whiteboard in this example will be encoded at ahigher resolution because more visual detail may be present in thewhiteboard's content while the live video may be captured only at fullHD resolution (or otherwise down converted for the purposes of the realtime communication session). Similarly, the whiteboard may be updatedinfrequently or not at all, which allows for a low frame rate whenconverting its visual contents into a video stream. In other words, thewhiteboard content is merely visual data and does not require a highframe rate when converted to a video stream. In contrast, live videorequires a higher frame rate in order for the human eye to perceive thatthe sequence of images making up the live video are representing fluidmotion.

To conserve the amount of data and processing resources used to encodethe video stream, encoder 512 identifies the whiteboard portions of thevideo image and the live video portions of the video image and segmentsthose portions into tiles. In this example, encoder 512 uses the H.265video codec which allows for separating a video image into tiles,although other codecs having segmenting capabilities may also be used.

FIG. 6 illustrates an example of image segmentation scenario 600 inoperational scenario 500. The video image is segmented into 9 tiles.H.265 requires that the lines defining the tiles span the entire image,however, other codecs may not have such a requirement. For example, inother codecs, tiles 6-8 may be combined into one tile. Encoder 512determines that tiles 1-8 are part of whiteboard content that should beencoded in 4K at 3.5 fps and the tile 9 is part of the live video thatshould be encoded in full HD at 30 fps. Therefore, encoder 512 encodeseach of tiles 1-8 independently in 4K at 3.5 fps and encodes tile 9 infull HD at 30 fps. While image segmentation scenario 600 keeps thespatial orientation of the tiles intact to show how the whiteboardcontent is separated from the live video, this spatial orientation isirrelevant when the tiles are encoded since the tiles are encodedindependently. For instance, tile 5 is encoded just as though imageframes of that tile make up the entire video image and that video imagehas a resolution equal to tile 5's portion of the full 4K image at 3.5fps (e.g. tile 5 may comprise a 256×1080 portion of the 4K image and istherefore encoded as a 256×1080 video). After encoding, the tiles arecombined spatially and temporally back into a single video image asH.265 video in 4K resolution at 30 fps.

FIG. 7 illustrates an image frame sequence 700 when combining theencoded tiles in operational scenario 500. Frame sequence 700illustrates how frames of a video stream are temporally combined.However, for clarity of illustration, the frames of FIG. 7 do notrepresent the total number of frames present for 30 fps video.

Sequence 700 includes 5 frames of the video stream numbered F0-F5,wherein the frame numbers correspond to the order in which the frame isin a sequence of video frames. The tiles have already been spatiallyplaced in their proper positions to reproduce the original video image.As shown, all of frames F0-F5 include information updating the image ofthe live video. Each frame refers back to a previous frame, as is commonduring video coding, since the differences between frames use less datathan an entire frame. In particular, the frames each correspond to atemporal layer. Frame F0 is of layer T0 because frame F0 is referred to,either directly or indirectly, by frames F1-F4.

Frame F4 is similarly of layer T0 since four frames, which are notillustrated, also refer to frame F4. Frame F2 is of layer T1 since it isreferred to by only one other frame, F3. And finally, Frames F1 and F3are of the highest layer, T2, since no other frames refer to either offrames F1 or F2.

Unlike the live video, the whiteboard information is only updated atframes F0 and F4 since the white board information has a much lowerframe rate and, therefore, does not require updates each time the livevideo is updated. As noted above, in a true 30 fps verses 3.5 fpsencoding, there would exist more than 3 frames in between F0 and F4where only the live video frames are updated. Regardless, whentemporally combining the tiles to produce a 4K at 30 fps H.265 videostream, the frames that do not include whiteboard frame updateinformation include information that indicates the whiteboard part ofthe image frame has not changed, which uses a negligible amount of data.Thus, even the frames that do not update the whiteboard informationstill comprise enough information to generate a full 4K resolution imagethough the whiteboard portion of the frame merely indicates thatprevious image information should still be used.

Referring back to operational scenario 500, after encoding by encoder512, the encoded composite video stream passes to communicationinterface 513 at step 2. Encoder 512 further passes an SEI message thatindicates information about the tiles that make up the composite videostream, such as tile positioning within the image and the parameterswith which each tile is encoded. Communication interface 513 thentransfers the composite video stream and the SEI message overcommunication network 503 to communication interface 523 of clientdevice 502 in step 3. It should be understood that additional clientdevices may also be participating in the video conference and maytherefore also receive the composite video and SEI message. If theadditional client devices do not support SEI messages, the additionalclient devices simply decode the received composite video stream as theywould normally decode a video stream in the same format. Similarly, ascomputing system, such as a conferencing server, may facilitate thedistribution of video between client devices.

Step 4 passes the composite video stream and SEI message to decoder 522.Decoder 522 may be a conventional H.265 decoder that decodes thecomposite video stream as standard 4K at 30 fps stream. However,ideally, decoder 522 is an example of decoder 301 and will reverse theencoding process to more efficiently decode the composite video. Thatis, decoder 522 uses the SEI message information to separate the imageof the composite video stream into tiles 1-9 as shown in FIG. 6. Thenthe whiteboard tiles 1-8 are individually decoded at 4K and 3.5 fps andthe live video is decoded in full HD at 30 fps. After decoding the tilesare recombined to display the video image on display 521 at step 5.

It should be understood that operational scenario 500 is performedcontinually during the video conference because, as live video iscontinually captured and as white board updates occur, the resultingvideo must continually be encoded and transferred to other clientdevices for display in real time. Separating the video image into tilesfor encoding uses processing resources more efficiently since tiles thatdo not require more processing for higher quality (e.g. higher framerate or higher resolution) are not unnecessarily processed in thatmanner. This allows a high resolution and high frame rate video streamto be decoded on devices and systems that have more limited processingpower. Likewise, less data is used for the video since lower qualityaspects of an encoded video file require less information.

FIG. 8 illustrates a computing architecture 800 that may be used toimplement a video encoder system, decoder system, or both. Computingarchitecture 800 is an example implementation of encoder 101 or decoder301, although encoder 101 and decoder 301 may use alternativeconfigurations. Similarly, computing architecture 800 is an exampleimplementation of client devices 501 and 502, although devices 501 and502 may use alternative configurations. Computing architecture 800comprises communication interface 801, user interface 802, andprocessing system 803. Processing system 803 is linked to communicationinterface 801 and user interface 802. Processing system 803 includesprocessing circuitry 805 and memory device 806 that stores operatingsoftware 807.

Communication interface 801 comprises components that communicate overcommunication links, such as network cards, ports, RF transceivers,processing circuitry and software, or some other communication devices.Communication interface 801 may be configured to communicate overmetallic, wireless, or optical links. Communication interface 801 may beconfigured to use TDM, IP, Ethernet, optical networking, wirelessprotocols, communication signaling, or some other communicationformat—including combinations thereof. Communication interface 801 maybe omitted in some examples.

User interface 802 comprises components that interact with a user. Userinterface 802 may include a keyboard, display screen, mouse, touch pad,or some other user input/output apparatus. User interface 802 may beomitted in some examples.

Processing circuitry 805 comprises microprocessor and other circuitrythat retrieves and executes operating software 807 from memory device806. Memory device 806 comprises a non-transitory storage medium, suchas a disk drive, flash drive, data storage circuitry, or some othermemory apparatus. Operating software 807 comprises computer programs,firmware, or some other form of machine-readable processinginstructions. Operating software 807 includes tile segmentation module808 and codec module 809. Operating software 807 may further include anoperating system, utilities, drivers, network interfaces, applications,or some other type of software. When executed by circuitry 805,operating software 807 directs processing system 803 to operatecomputing architecture 800 as described herein.

In particular, to operate as an encoder of a composite video stream,tile segmentation module 807 directs processing system 803 to identify afirst portion of an image of a video stream for encoding using firstparameters and a second portion of the image of the video stream forencoding using second parameters, and to segment the first portion ofthe image into one or more first tiles and the second portion of theimage into one or more second tiles. Codec module 809 directs processingsystem 803 to encode the first tiles using the first parameters and thesecond tiles using the second parameters and, after the encoding, tocombine the first tiles and the second tiles into the composite videostream.

Additionally, to operate as a decoder of a composite video stream, tilesegmentation module 807 directs processing system 803 to receive thecomposite video stream and segment the composite video stream into thefirst and second tiles. After segmenting, codec module 809 directsprocessing system 803 to decode the first tiles based on the firstparameters and decoding the second tiles based on the second parametersand, after decoding, to combine the first tiles and the second tiles fordisplay of the image of the video stream.

The above description and associated figures teach the best mode of theinvention. The following claims specify the scope of the invention. Notethat some aspects of the best mode may not fall within the scope of theinvention as specified by the claims. Those skilled in the art willappreciate that the features described above can be combined in variousways to form multiple variations of the invention. As a result, theinvention is not limited to the specific embodiments described above,but only by the following claims and their equivalents.

What is claimed is:
 1. A method of encoding a composite video stream,comprising: identifying a first portion of an image of a video streamfor encoding using first parameters and a second portion of the image ofthe video stream for encoding using second parameters; segmenting thefirst portion of the image into one or more first tiles and the secondportion of the image into one or more second tiles; encoding the firsttiles using the first parameters and the second tiles using the secondparameters; and after the encoding, combining the first tiles and thesecond tiles into the composite video stream.
 2. The method of claim 1,wherein the first parameters comprise a first frame rate and a firstresolution and the second parameters include a second frame rate and asecond resolution.
 3. The method of claim 2, wherein the first framerate is higher than the second frame rate and the first resolution islower than the second resolution.
 4. The method of claim 3, wherein thecomposite video stream comprises a video having the first frame rate andthe second resolution.
 5. The method of claim 1, further comprising:transferring the composite video stream to a decoder system, wherein thedecoder system decodes the composite video stream for display.
 6. Themethod of claim 5, wherein the decoder system segments the compositevideo stream, based on tile segment information, into the first andsecond tiles before decoding the first tiles based on the firstparameters and decoding the second tiles based on the second parameters.7. The method of claim 6, further comprising: transferring the tilesegment information to the decoder system using a supplementalenhancement information (SEI) message.
 8. The method of claim 1, whereinthe encoding is performed using the H.265 codec.
 9. A non-transitorycomputer readable medium having instructions stored thereon that, whenexecuted by an encoder system, direct the encoder system to perform amethod of encoding a composite video stream, the method comprising:identifying a first portion of an image of a video stream for encodingusing first parameters and a second portion of the image of the videostream for encoding using second parameters; segmenting the firstportion of the image into one or more first tiles and the second portionof the image into one or more second tiles; encoding the first tilesusing the first parameters and the second tiles using the secondparameters; and after the encoding, combining the first tiles and thesecond tiles into the composite video stream.
 10. The non-transitorycomputer readable medium of claim 1, wherein the first parameterscomprise a first frame rate and a first resolution and the secondparameters include a second frame rate and a second resolution.
 11. Thenon-transitory computer readable medium of claim 10, wherein the firstframe rate is higher than the second frame rate and the first resolutionis lower than the second resolution.
 12. The non-transitory computerreadable medium of claim 11, wherein the composite video streamcomprises a video having the first frame rate and the second resolution.13. The non-transitory computer readable medium of claim 9, wherein themethod further comprises: transferring the composite video stream to adecoder system, wherein the decoder system decodes the composite videostream for display.
 14. The non-transitory computer readable medium ofclaim 13, wherein the decoder system segments the composite videostream, based on tile segment information, into the first and secondtiles before decoding the first tiles based on the first parameters anddecoding the second tiles based on the second parameters.
 15. Thenon-transitory computer readable medium of claim 14, wherein the methodfurther comprises: transferring the tile segment information to thedecoder system using a supplemental enhancement information (SEI)message.
 16. The non-transitory computer readable medium of claim 9,wherein the encoding is performed using the H.265 codec.
 17. Anon-transitory computer readable medium having instructions storedthereon that, when executed by an decoder system, direct the decodersystem to perform a method of decoding a composite video stream, themethod comprising: receiving the composite video stream, wherein thecomposite video stream comprises a combination of one or more firsttiles of an image of a video stream encoded using first parameters andone or more second tiles of the image of the video stream encoded usingsecond parameters; segmenting the composite video stream into the firstand second tiles; after segmenting, decoding the first tiles based onthe first parameters and decoding the second tiles based on the secondparameters; and after decoding, combining the first tiles and the secondtiles for display of the image of the video stream.
 18. Thenon-transitory computer readable medium of claim 17, wherein the methodfurther comprises: receiving tile segment information associated withthe composite video stream; and wherein segmenting the composite videostream is performed based on the tile segment information.
 19. Thenon-transitory computer readable medium of claim 17, wherein the firstparameters comprise a first frame rate and a first resolution and thesecond parameters include a second frame rate and a second resolution.20. The non-transitory computer readable medium of claim 19, wherein thefirst frame rate is higher than the second frame rate and the firstresolution is lower than the second resolution.